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Entropy and mixing 
for amenable group actions 

By Daniel J. Rudolph and Benjamin Weiss* 
Abstract 

For r a countable amenable group consider those actions of T as measure- 
preserving transformations of a standard probability space, written as {T 7 } 7g r 
acting on (X, J 7 , fi). We say {T 7 } 7g r has completely positive entropy (or simply 
cpe for short) if for any finite and nontrivial partition P of X the entropy 
h(T, P) is not zero. Our goal is to demonstrate what is well known for actions 
of Z and even Z rf , that actions of completely positive entropy have very strong 
mixing properties. Let Si be a list of finite subsets of T. We say the Si spread 
if any particular 7 7^ id belongs to at most finitely many of the sets SiS^ 1 . 

Theorem 0.1. For {T 7 } 7g r o- n action ofT of completely positive entropy 
and P any finite partition, for any sequence of finite sets Si C T which spread 
we have 

jToM V T i(P))^(P). 
ifSi jeSi % 

The proof uses orbit equivalence theory in an essential way and repre- 
sents the first significant application of these methods to classical entropy and 
mixing. 

1. Introduction 

The goal of this work is to lift a part of the theory of iv"-actions in the 
class of measure-preserving transformations of standard probability spaces to 
actions of countable amenable groups in particular to show that they must 
be mixing and in fact mixing of all orders. The standard proof of this for 
transformations (actions of Z in our vocabulary) or more generally for actions 
of Z rf relies on the existence of the past- algebra and tail- algebra of a partition 
relative to the action. It seems there is no such good notion for actions of 
general groups, in particular discrete amenable groups. There are two reasons 
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for working with actions of countable amenable groups. First there is a rather 
good analogue of the entropy theory of Z actions for them. This is found in 
[7] and this as well as the general formulation there of a Rokhlin lemma for 
such actions are the technical background ideas we will use. Second, these 
actions are characterized in [1] as precisely those which are orbit-equivalent to 
actions of Z. We use such an orbit equivalence in an essential way to transfer 
our problem from T to Z. Neither entropy nor any mixing properties are orbit 
equivalence invariants. On the face of it this would seem to block the use of such 
an orbit equivalence in this area. We avoid this seeming obstacle by working 
with entropy and mixing properties relative to a sub a-algebra and with orbit 
equivalences that are measurable with respect to this sub u-algebra. We show 
the former are invariants of the latter. This is the first significant application 
of orbit equivalence theory to classical entropy and mixing questions. 

We point out that J. Kieffer [5] constructed a version of the Shannon- 
McMillan Theorem for amenable group actions predating the work of [7] by 
the introduction of a generalized tail- field. T. Ward and Q. Zhang in [9] use this 
method to obtain a conditional entropy theory for such actions. This approach 
does not appear able to directly give Theorem 2.3 via this generalized tail-field. 
The work of Ward and Zhang does give essentially all the results we need for 
conditional entropy. We give an alternative approach to these results anyway 
building on the methods of [7] rather than those of [5] . 

In the next section we give a detailed discussion of our results and the 
proof of Theorem 2.3 except for two results (Theorems 2.6 and 2.11) that 
require rather extended argument. 

In Section 3 we lay out the theory of Rokhlin towers for actions of count- 
able amenable groups, as developed in [7] and the basic geometry of transfer- 
ence leading to Theorem 2.11 on spread sets and a dual result on invariant sets. 
These results are the tools that allow us to pull entropy and mixing properties 
through an orbit equivalence. In Section 4 we demonstrate the basic entropy 
theory we will need to prove Theorem 2.6. As promised we also include the 
simple argument that the direct product of a cpe action and a Bernoulli ac- 
tion is relatively cpe over the Bernoulli coordinate. This will complete all the 
technicalities for the proof of Theorem 2.3. In the final section we discuss a 
number of natural extensions of our work here. These include generalizing our 
work here to actions of continuous groups and several questions concerning 
amenable groups that might be answerable using orbit equivalence methods. 

2. Statements of results and proof of main theorem 

Let r be a countable discrete amenable group. For our purposes the most 
natural meaning for this is that T possesses a Folner sequence of sets Hi. We 
will go into more detail on this later. Suffice it here to say that these provide 



AMENABLE CPE- ACTIONS MIX 



1121 



a good analogue for intervals, squares, or boxes of indices in Z d for use as both 
averaging sets for ergodic theorems and for index sets for Rokhlin-like lemmas. 
Suppose T = {T 7 } 7e r is an action of T by measure-preserving transformations 
on a standard probability space (X, J 7 , fx). If P is a finite partition of X one 
can define the entropy of the process (T, P) as 

1 

That this limit exists and is independent of the choice of F0lner sequence is a 
part of the standard machinery found in [7] . 



h(T, P) = lim inr h(_ y jr T 7 -i (P)). 



Definition 2.1. We say a V- action T has completely positive entropy (or 
simply is cpe for short) if for any nontrivial finite partition P, 

h(T,P) > 0. 

This notion is analogous to one of the many definitions of K-ness of a 
single measure-preserving transformation. As said earlier we use this more 
precise vocabulary to avoid confusion among all these definitions. Our goal 
is to demonstrate that if T is cpe then it must be mixing of all orders. We 
will demonstrate something somewhat stronger, that there is a uniformity in 
this that provides a condition equivalent to cpe. We now formulate this. In a 
countable group V we say a sequence of sets Si spreads if each 7 / id belongs 
to at most finitely many of the sets SiS^ 1 and for a finite set K C T we say 
S is -fT-spread if for any 73 7^ 72 £ S we have 7172" 1 ^ K. If T is Z, for a 
sequence of sets to spread simply means the gaps between consecutive terms 
in the sets grow. The following result is well known for cpe actions of a single 
transformation and is what we will generalize here. 

Theorem 2.2. For T a measure-preserving transformation, T is a 
K- automorphism (is cpe) if and only if for any finite partition P and e > 
there is an N so that for any finite subset S = {si < S2 < ■ ■ ■ < s t } C Z with 
■Sj+i — Si > N for all i, 



< e. 



In Theorem 2.2 the conclusion can be written in a couple of equivalent 
forms. First a form that is a priori weaker: 

±h(y jeS T-*(P))>h{P)-e 

and now a form that is a priori stronger: 

d t (V jeS T- ] {P);B t (P)) <e 
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where B t (P) is an independent and identically distributed sequence of t random 
variables each with the distribution of P. It is quite standard to show that 
these three are all in fact equivalent. The argument is on finite lists of random 
variables and hence remains true for actions of countable groups. For actions 
of a single transformation T one can state another equivalent condition, that 
the sequence of processes (T n ,P) converge in d to B(P), the independent and 
identically distributed process with time-zero distribution that of P. 

This last formulation may not make sense for an arbitrary discrete amenable 
group, as it may not have spread co-finite subgroups. But if it does we will be 
able to reach this conclusion as well. 

Here is the goal of our work: 

Theorem 2.3. For T a cpe action of the discrete amenable group T and 
any finite partition P and e > there is a finite subset K C T so that for any 
finite set S that is K-spread 

1 



-My yeS T^-i(P))-h(P) 



< e. 



To demonstrate Theorem 2.3 we "transfer" the problem to actions of Z 
by the fundamental result of [1], that any discrete amenable group action is 
orbit-equivalent to an action of Z and by our development here of a relative 
entropy theory for actions of V. 

Definition 2.4. For T a measure-preserving action of T, A a T-invariant 
sub-u-algebra and P a finite partition we define the conditional entropy of the 
process (T, P) given the algebra A to be 

h(T, P\A) = inf{/t(T, P V Q) - h(T, Q) : Q is ^-measurable} . 

This is of course just the standard definition for actions of Z. We can also 
consider orbit equivalences that are „4-measurable. 

Definition 2.5. Let T be a measure-preserving and free action of T and 
A a T-invariant sub-cr-algebra. Suppose S is a free action of some perhaps 
different group V but having the same orbits as T. We say the orbit change 
from T to S is A-measurable if for each 7' G V the functions 7(3;) defined by 

T l{x) {x) = Sy(x) 

are ^.-measurable. 

For example, suppose T is actually a direct product of two free T-actions 
T\ x T2 and there is an action U of some V with the same orbits as T2 . The 
action of U then can be lifted uniquely to an action U with the same orbits 
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as T\ x T 2 . If Uj'(x2) = T 2 ^(x2)( x 2) then we can (and must) set £/y(xi,X2) = 
(Ti x r2) 7 ( X2 )(xi,X2). In this case it is clear that the orbit change from T\ x T 2 
to U is measurable with respect to the second coordinate cr-algebra £> 2 . 

Note also that if the orbit change from T to U is .A-measurable then A is 
a [/-invariant sub-cr-algebra. 

With these comments in mind we can now state the core technical fact we 
will use to obtain Theorem 2.3. 

Theorem 2.6. Suppose T is a free and ergodic action of a countable and 
discrete amenable group V and A is a T-invariant sub-a- algebra. Suppose also 
that U is a free (and necessarily ergodic) action of F' with the same orbits 
as T (r' is necessarily amenable). Suppose the orbit change from T to U is 
A-measurable. Then for any finite partition P we conclude 

h{T,P\A) = h(U,P\A). 

Notice that this result is in sharp contrast to the absolute case. All ergodic 
actions of Z are orbit-equivalent so entropy itself cannot be invariant. Here that 
we deal with conditional entropy and require the orbit change to be measurable 
with respect to the sub-cr-algebra on which we condition changes the situation 
entirely. In this situation the orbit change no longer has the freedom to modify 
the conditional entropy, that is to say the entropy that T has above and beyond 
the entropy on the sub-cr-algebra. 

With this result in hand and T some cpe action of T we "transfer" the 
problem to Z by considering a direct product action 

T x T 2 = {T 7 x T 2i7 } 7er , 

where all we really need of T 2 is that this direct product should still be ergodic 
and the action of T 2 should be free and hence orbit-equivalent to an action of 
Z. As described above this means the orbit equivalence of T 2 to some Z-action 
U will lift to an ergodic map U giving a Z action orbit-equivalent to T x T 2 . 
Moreover the orbit change from T x T 2 to U is measurable with respect to its 
second coordinate. 

In the direct product of T and T 2 consider the relative Pinsker algebra over 
the second coordinate, that is to say the cr-algebra generated by all partitions 
P with h(T x T 2 , -P|£> 2 ) = 0. When this algebra is just i3 2 the action is said to be 
relatively cpe over £> 2 . This relative Pinsker algebra contains both £> 2 and the 
Pinsker algebra of T. It is a general fact in a direct product like this that the 
relative Pinsker algebra is precisely the span of these two algebras, (see [2]). 
When T 2 is a Bernoulli action there is a particularly easy proof so we include 
this (see Corollary 4.11). In our case this means that the Pinsker algebra of 
T x T 2 is just that of T 2 which is to say T x T 2 is relatively cpe over £? 2 . 



1124 



DANIEL J. RUDOLPH AND BENJAMIN WEISS 



We conclude that U must be relatively cpe (relatively K) over the algebra 
£>2 as well. The theory of relative K-ness for actions of Z is well-developed 
and parallels completely the nonrelative case. We will quite regularly write 
expressions of the form E(P\A) where P is a finite partition and A is a sub-a- 
algebra. Consider P as a map from X to the labels {1,2,..., n}. By E(P\A) 
we will mean the probability vector 



For U an ergodic measure-preserving transformation, A a ^/-invariant sub- 
u-algebra, P a finite partition, and S a finite subset of Z we define 



L{V l&s U~\P)\A) = H{E{V l£S U~\P)\A)) = E{I{V ieS U~\P)\A)\A). 



That is to say, we calculate the conditional probabilities given A of the various 
sets in Vj G s£7 _J (P) and then take the entropy of this probability vector. (For 
our purposes h calculates the entropy of a partition relative to a fixed measure 
and H calculates the entropy of a probability vector.) 

The following result, due to M. Rahe [8] is fundamental to obtaining our 
result. Notice that when A is the trivial algebra it reduces to Theorem 2.2. 

Theorem 2.7 (M. Rahe [8]). Suppose U is an ergodic measure-preserving 
transformation, A is a IJ -invariant sub-a- algebra, and U is relatively K with 
respect to A. Then for any finite partition P and e > there is an N so that 
for any finite subset SCM with \si — Sj\ > N for all i / j, 



Notice that in the conclusion all the information functions are ^.-measur- 
able but as stated the set S is a constant over the space. We need to consider 
the possibility that S = S(x) is an „4-measurable choice of the set of indices 
along which the calculation is made. We also will need to loosen up the con- 
dition that all gaps in the sequence are at least N. 

Definition 2.8. Suppose S(x) is a measurable choice of a fc-element subset 
of Z. We say S(x) is iV-quasi-spread if for all x outside a subset of measure 
less than 1/N, there is a subset S'(x) C S(x) with #S'(x)/#S(x) > 1 - 1/N 
and for all distinct s, s' £ S'(x) we have 



We can generalize this idea to a countable discrete group T by fixing a 
listing of its elements 71,72,... = T. Suppose S(x) = {si(x), . . . , Sk(x)} is 
a measurable choice of a /c-element subset of T. We say S(x) is A^-quasi- 
spread if for all x outside a subset of measure less than 1/N, there is a subset 



{E(l P - 1{1) \A),...,E(l P - 1(n) \A)}. 




s-s'\> N. 
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S'{x) C S(x) with #S'(x)/#S(x) > 1 - l/N and for all distinct s,s' e S"(ar) 
we have 

s~V ^ {71,72, • • ■ ,7at}- 

Although this latter definition depends explicitly on how we choose to list 
the elements of the group it is easy to see that if we fix two listings of the 
group, to be well quasi-spread for one listing is to be well quasi-spread for the 
other. 

We conjecture that just assuming that S(x) is sufficiently quasi-spread will 
not be enough to prove Theorem 2.7. Our proof requires us to ask something 
else of the function that is automatically true for a constant set. 

Definition 2.9. For S(x) a measurable choice of /c-element subset of T we 
say S is uniform relative to the T-action T if there are k elements Wi = T s .^(x) 
in the full-group of T (i.e. the Wi are one-to-one, onto and measure-preserving) 
such that 

S(x) = {si(x),... ,s k (x)}. 
Notice that this implies the k image points Wi(x) are distinct. 

The following simple observation is one piece of our transference, saying 
that the notion of uniformity transfers through a rearranging of the orbit. 

Lemma 2.10. Suppose T is a free and ergodic action of the countable 
group T and U is an action of another countable group T' with the same orbits 
as T. Suppose S(x) is a k- element set-valued function of X taking values in T 
and uniform. Let Wi(x) be k maps whose set of images form the S(x). Each 
Wi can be written as 

W i (x) = U Vi{x) (x). 

Letting V(x) = {v±(x), V2(x), . . . , Vk(x)}, the set-valued function V(x) takes 
values in the k-point subsets of V and is also uniform. 

This piece of the transference is transparent. The other piece will require 
proof. 

Theorem 2.11. Suppose T is a free action of the countable group T = 
{7i)72)---} an d U is a free action of T' = {71,72, •••} with the same orbits. 
Given any N 6 N there is an M G N so that for any k and S(x) taking values 
in the k-point subsets ofT that is uniform and M- quasi- spread, the set-valued 
function V(x), taking values in the k-point subsets of T' will be uniform and 
N- quasi- spread. 

We postpone the proof of Theorem 2.11 to Section 2. 
In Theorem 2.7 it is enough to ask that the set S be iV-quasi-spread as 
the indices outside S' will contribute only an error on the order of log jfP/N 
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to each term in the difference. What is not at all clear though is that we can 
replace a constant set S with a variable S(x). We will see though that under 
the assumption of uniformity we can. 

Theorem 2.12. Suppose U is an ergodic measure-preserving transfor- 
mation, A is a U -invariant sub-a- algebra, and U is relatively K with respect 
to A. Then for any finite partition P and e > there is an N so that for any 
A-measurable function S taking values in the k-point subsets ofTL that is both 
N- quasi- spread and uniform we will have 




Proof. To say that U is relatively K with respect to the cr-algebra A is to 
say that any partition P has a trivial relative tail. That is to say, 

p|(V U~ i {P)\/A)=A 

N i>N ' 

and hence 

lim E{P\ \/i> N U'\P) V A) = E{P\A). 

N—*tx> 

Assume P is an n-set partition 

B N = {x: \L(P\ Vi> N U~\P) V A)(x) - L(P\A)(x)\ < e/(4Iogn)}. 

Choose N sufficiently large that /x(5jv) > 1 — e/(41ogn) and that 1/N 
< e/(41ogn). 

Now assume S(x), taking values in the fc-point subsets of Z, is both 
./V-quasi-spread and uniform. For each x G X rewrite S(x) as si(x) > s 2 (x) > 
■ ■ ■ > s t ( x j(x), s t ( x } + i(x), . . . , Sfc(x) where the ordering is chosen so that t(x) is 
maximal for the properties 

1) for i < t(x), Si(x) — Si + i(x) > N, and 

2) for i < t{x), U Si( - x \x) e B N . 

Notice that as S(x) is ^.-measurable as is Bn, we can assume that both Si(x) 
and t(x) are as well. 

As S is uniform we can calculate 

J #{ Si {x) : U Sl(x) {x) £ B N }dfi> fc(l-e/(41ogn)) 

and letting S'(x) C S(x) be a maximal set of indices pairwise more than N 
apart, as S is iV-quasi-spread 

J #S'(x) dn > k(l - e/(2 log n)). 
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We conclude that 

J t{x) dfi > fc(l - 3e/(41ogn)). 

We now calculate L(\/ ieS / x \U~ l (P)\A) one term at a time in the form 
k 

Y^L{U s ^ x) {P)\ U S ^ X \P)\JA). 
i=i 



We immediately conclude that 



L{V ieS{x )U-\P)\A) <J2Hu- Si{x) (P)\A) 



i=i 

and so we only need an inequality in the other direction. 
The following inequalities are true almost surely in x. 

k 

t ]T L{U s ^ x) (P) | Vj<i U~ s ^ x) (P) V A) 
i=i 

> g L (^W(P)| V j>Si{x)+N U-i(P)VA) 
i=i 

t(x) 

> £(L(tf«<*)(P)|.A)-e/(41ogn)) 
i=i 

A; 

> J2(L{U S ^ X \P)\A) - e/(41ogn)) - (k - t{x)) In 
i=i 

We conclude that 



n. 



[ \l(v i£S{x) u-\p)\a) -Wl(u^(p)\a) 

< e/(41ogn) + A; - J td/j,<£. 



□ 



Combining Theorems 2.12 and 2.11 we reach the following conclusion. 

Theorem 2.13. Suppose T is a free and ergodic action of a countable 
amenable group V = {71, 72, • • •}, A is a T-invariant sub-a- algebra restricted to 
which the action of T is still free and relative to which the action of T is cpe, 
and lastly P is a finite partition. Given any e > there is an M so that for 
any k and any A-measurable function S(x) taking values in the k-point subsets 
o/T, if S is both uniform and M- quasi- spread then we will have 



h^esT^PlA) - \ £ L{T 1 -i{P)\A) 



< e. 
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Proof. Construct an action U of Z that has the same orbits as T and for 
which the orbit rearrangement is „4-measurable. For any S(x) we now obtain 
a V(x) taking values in the fc-point subsets of Z that must be uniform. Notice 
that 

L(V^ s{x) T y -i(P)\A)(x) = L(y teV{x) U'\P)\A)(x) 

and for almost every (a.e.) x, for each 7 6 S(x) there is a unique i G V(x) 
with Tj(x) = U l (x) and so 

L{T^{P)\A)=L{U-\P)\A). 

Thus we know 



l -L{y^ s T^{P)\A) -\Y, L(.t^(P)\A) 



76S 



U(V ieV U-\P)\A) L{U-\P)\A) 

From Theorem 2.11 we know that for any N if M is large enough then V 
will be iV-quasi-spread and Theorem 2.12 completes the theorem. □ 

In the case we are interested in T is in fact T\ x T2, the algebra A is the 
second coordinate algebra, P is measurable with respect to the first coordinate 
algebra and S is a constant choice of set. In this case notice that 



and 



L(V 76S T 7 -i(P)|^) = /i(V 76S T 7 -i(P)) 

L(T J - 1 (P)\A)=h(P). 
From this the proof of Theorem 2.3 is complete. 



3. Discrete amenable groups and spread sets 



In this section we lay out the basic ergodic theory of measure-preserving 
actions of countable and discrete amenable groups. This material can all be 
found in [7] or [4]. Our goal is to describe some particular structures that are 
invariants of orbit equivalence. As described in the introduction we will be 
"transferring" our work from an action of T to one of Z with the same orbits. 
We here establish a vocabulary for transferring certain calculations through 
this transference. The most important of these will be conditional entropy. 
The transference of these calculations will be made by showing that they can 
be made on towers. Our first step is to lay out the structure of Rokhlin towers 
as developed in [7]. These build from notions of invariance of sets in V. 
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Notions of essential invariance of finite subsets F C T is central to the 
study of the ergodic theory of discrete amenable group actions. Here is the 
fundamental definition used in [7]. 

Definition 3.1. Let 5 > 0. Let K C T be a finite set. A subset F C T is 
called (8, K)-invariant if 

^(KK-'FAF) 

— w — <6 - 

To say that a set F is sufficiently invariant means that for some (unspec- 
ified) 5 > and finite K C T, F is (5, F)-invariant. 

To say that a list of sets Fl , F2 , . . . , F& is sufficiently invariant is to say that 
for some 5 > and finite set K C T, setting Fo = F, for each j G {1,2,... , fc}, 
the set Fj is (5, F 7 _i)-invariant. 

The existence of (5, F)-invariant sets for all 5 > and finite sets K is 
equivalent to the amenability of the countable group V. 

For purely technical reasons a different but equivalent calculation of the 
degree of invariance of a set is more convenient for us. 

Definition 3.2. Let 5 > and K C T be a finite set. A subset F C T is 
called [K, 5] -invariant if 

#{7 G F : FF^V C F} > (1 - <5)#F 



One calculates that if F is (<5/(#F) 2 , F)-invariant, then F is [K, S]- 
invariant and conversely if F is [K, <5/(#F) 2 ]-invariant, then F is (5, K)- 
invariant. Hence when one says that either F or a list Fi, ■ ■ ■ , F& are sufficiently 
invariant one need not distinguish which of the two notions, Definition 3.1 or 
Definition 3.2 is meant. 

We now present the Ornstein- Weiss quasi-tiling theorem. 

Definition 3.3. A finite list of sets Hi,H2, ■ ■ ■ , C T, with id G F^j, for 
all i, is said to s-quasi-tile a finite set F C T if there exist "centers" Cjj, 
i = 1, 2, . . . , fc, j = 1, 2, . . . , and subsets Fjj C Hi such that: 

1. #H iJ >(l-e)#H i ,foTj = l,...,l(i), 

2. the HijCij C F are disjoint, and 

3. #(UH id aj) > (l-e)|F|. 

Theorem 3.4 ([7]). Given e > 0, i/iere exists iV = iV(e) such that in any 
countable discrete amenable group T, if Hi, . . . , Hn is any sufficiently invariant 
list of sets, then for any D C T that is sufficiently invariant (depending on the 
choice of Hi, . . . , Hn), D can be e-quasi-tiled by Hi, . . . , H^. 
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This theorem is the essential content of Theorem 6, 1.2 [7]. Our defini- 
tion of e-quasi-tiling is slightly different; weaker in that we do not ask that 
HiCij n HkCkj = 0, i 7^ k, and stronger in that we require HijCij C F. Ob- 
taining the latter from Theorem 6, 1.2 [7] is easy if D is sufficiently invariant 
and N is fixed. 

Suppose (X, B, n) is a standard probability space. Suppose T is a measure- 
preserving free action of T on X. It is useful to keep in mind that T is a function 
of two variables, i.e., is a map from r x X to X. For a finite set FCT and 
measurable subset A G B with fi(A) > 0, consider F x A C V x X. As a 
measure on F x i, put the direct product c x ji of counting measure c and 
/i. Consider the map T restricted to this rectangle Fx A On each level set 
g x A, T is one-to-one and measure-preserving. On any fiber set F x x, T is 
again one-to-one. We definitely do not expect T to be one-to-one on F x i. 
It is clear, though, that T is nonsingular and, at most, j^F to 1. 

For S C F x A, let 

= #{ 9 eF;(g,x)eS} 

and 

c s (x) = #S(x). 

Of course 

(c x ^(S) = J^c s (x)dn(x). 

Set 

cfS 1 ) = min cs(x). 

Definition 3.5. We say that F x A maps an e- quasi- tower if there exists 
a measurable subset S C F x A such that: 

1. T\s is one-to-one, and 

2. c(S)>(l-e)#F. 

We say -F x A maps a real-tower if T\f x a is one-to-one. 

The e-quasi-tower itself (or real-tower as the case may be) is the set 
T(F x A) C X. Notice that we may always assume T(S) = T{F x A). Also 
notice that if there exists an S C F x A, such that T is one-to-one on S and 
(c x fi)(S) > (1 — e 2 )(c x fi)(F x A), then there must exist an A' C A, with 
fi(A') > (1 - e)/u(^4), such that c s (x) > (1 - for all x G A'. Hence 

Fxi' maps to an e-quasi-tower. We will return to real-towers in a bit but 
for now we focus on e-quasi-towers, which are the core of the Rokhlin lemma 
of [7]. This is only a minor modification of Theorem 5, II. 2 of [7]. 
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Theorem 3.6 ([7]). Suppose T is a discrete amenable group. For any 
e > 0, there exist 5 > 0, K C T and N = N{e) such that for any sequence 
H\, . . . , Hn of [K, 5]- invariant subsets of T, and any free measure-preserving 
T-action T = {T 9 } 7g r, acting on (X,B,fj,), there exist sets Ai, . . . , An G B 
such that: 

1. each Hi x Ai maps to an e-quasi-tower TZi in X, 

2. for i / j, TZi n Kj = 0, and 

3. ^ =l TZi) > 1 - e. 

A collection of sets of the form {Hi x Ai}f =l satisfying (1), and (2) we say 
forms e-Rokhlin towers. We indicate (3) by saying the towers cover (1 — e) of 
X. If the Hi x Ai are real-towers satisfying (2) then we say {Hi, Ai} is a castle 
(again covering (1 — s) of X.) We will be using castles a lot as we proceed so 
we stress what they are. A castle is a collection of the form {Hi,Ai} where 
each pair Hi,Ai form a real-Rokhlin tower and the list of towers have disjoint 
images. We refer to the union of these images in X as the castle image. 

Notice that if {Hi x A{\ form an e-quasi-Rokhlin tower then in each of 
the Hi x Ai there is a subset Si with c(Si) > (1 — e)ffH,i. For each x £ A 
remember Si(x) = {7 : (7, x) € Si}. Partition Ai into subsets Ai.j on which the 
choice of set Si{x) is a constant Hij. Now ffH,^j > jfHi{l — e). The collection 
of sets {Hi j x Aij} forms a castle covering the same set as the original e- 
quasi-Rokhlin tower. If each Hi is (K, <5)-invariant, then the new H^ j are still 
(K, 5 + 2e)-invariant. Furthermore if the sets Ai were known to be measurable 
with respect to some invariant sub-a-algebra then the bases of the castle could 
also be selected to be measurable with respect to that invariant algebra. 

We now state and prove a castle based version of the mean ergodic theorem 
for actions of countable amenable groups. This fact is not difficult to prove. 
We choose a version of the proof which is parallel to our proof of Theorem 2.6 
as a warm-up to that proof. 

For T an action of T, f a real- valued function, and {Ai, Hi} forming towers 
(real or quasi does not matter yet) we can ask how close averages of / across 
the towers approximate the integral of /, that is to say we can ask how small 
is 



It may appear that a normalization by is missing but the sum is 

normalizing each tower to have mass #Hin(Ai). 

The next lemma gives the pivotal observation, that if there exist towers 
where we are seeing convergence on average, then on all sufficiently invariant 
towers we must also be seeing such convergence. 
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Lemma 3.7. Suppose T is a free and ergodic action of the countable group 
T and f G Li(fx). Given any e > there is a 5 > so that if there exists a 
castle {Ai,Hi} covering all but 5 of X and satisfying 



E/jE /(r 7 (x)) - Jfd» 



dfi < 5, 



then for any sufficiently invariant Kj, if a castle {Bj, Kj} covers at least e of 
X we will have 



E/ E /(W) - # k j I f 



dfx 



dfi < e. 



Proof. Without loss of generality we can assume that f f d/j, = and 
hence that 




Also without loss of generality we can assume that there are only finitely many 
sets Hi and hence there is some finite K containing all of them. Choose 5\ so 
small that for any set C with n{C) < <5i we must have f c \ f\ d\i < e/2. Assume 
all the Kj are [K, Si/3] -invariant and be sure e{5\/3) 2 > 5. Let T C X be 
the tower image of {Ai,Hi} and suppose the castle {Bj,Kj} covers at least e 
of X. 

Consider the set Dj C Bj where 

#{ 7 G Kj : T 7 (s) i T} > ffK^/3. 
As 1 - S < n(T) < 1 - Ej V-{Dj)#Kj5i/?>, we must have 

j 

which is to say 

Y,n{Dj)#Kj >^fi(Bj)#Kj(l - V3) > e(l - 

j j 

For each Bj and each x e Dj consider the collection of group elements 

dj(x) = {7 G ifj : with T 7 (x) ^ T or is not contained in Kj}. 

For x G D| we know #dj(x) < 28 1 #K j /3. 

Define a global "bad" set D in the castle image by 

D = U( U : 7 G ^} U (J {T 7 (x) : 7 G 
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We calculate 

»(D) < Y i ^Dj)#K j + ^B j )#K j 28 1 /Z < 6 1 . 



Split / into two functions as / = fi + f 2 where 

h(x) = 

We know 



if x € D 
f(x) otherwise. 



EL E/* 



dfj, < e/2. 



To show the same for /i notice that 



E/ E /iCW 

3 • leKj 



d/j, 



d\x 



E/ DC E /CW) 

i J 7€<i,(rr) 

= E/ E (E/cvm) 

Ty(x)eA i 

< E/ E (E/ev(*)) 

^ eA e/( t 7^)) 

as 

U ( U 

is a disjoint union. This latter is less than e/2, finishing the result. 



dfj, 



d/i 



dfi 



□ 



Theorem 3.8. For T an ergodic action of the countable amenable group 
T and f G L±(fi) and e > for any sufficiently invariant castle {Ai, Hi} whose 
image covers at least e of X we will have 



W £ f(T 7 (x))-#Hi [fd» 



<i/x < e. 



Proof. From the previous lemma all we need to know is that for any 5 > 
there exist towers covering all but 5 of X and for which the estimate holds 
with 5 replacing e. Here is a very simple way to get them. Construct a direct 
product of T with some auxiliary free and ergodic action T\ of V so that the 
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direct product T x T\ is still ergodic. In this product consider only towers 
{Bj,Kj} that are measurable with respect to the second coordinate algebra. 
As the o"-algebra generated by the action of T on / is independent of the second 
coordinate algebra, if we know all the Kj are so invariant that from the mean 
ergodic theorem 



We will reuse this little trick of adding on a second coordinate in an even 
more decisive way later in proving Theorem 2.6. The next definition is dual to 
that of a set being well quasi-spread. For consistency we make the definition 
more parallel by fixing a listing of the elements of V = {71, 72, • • •}• 



Definition 3.9. We say that the castle {Hi x A{\ is N -invariant if setting 
K = {71, • • • ,1n} 



Notice that if each of the sets Hi is individually [K, l/N] invariant then the 
castle will be iV-invariant, but the invariance of the castle is an average notion 
and so l/N 2 invariance of the castle will imply [{71, . . . ,7jv} 5 1/iV] -invariance 
of all but l/N of the {Ai, Hi} (measured by their fraction of the castle image). 

We have already seen how a spread set transfers through an orbit equiv- 
alence to another action. We describe the parallel transference for castles. 
Suppose T is a free and ergodic action of a countable group T and U is a free 
action of T' with the same orbits. What lies behind the transference is of course 
the cocycle of the relation a : X xT ^ T f where T 7 (x) = U a ( x ^{x). It is useful 
to also give a name to the cocycle in the other direction j3 : X x r" — > V where 
Uy(x) = Tpr x ,yi\(x). Suppose {Ai, Hi} is a castle for the action of T. For each 
Ai and point x £ Ai consider the set of points T{x) = {T 7 (x) : 7 G Hi}. This is 
the slice through the tower image passing through x. Setting Ki(x) = a(x,Hi) 
we have 



Partition Ai into subsets Bij according to the set -fQ(x) = Kij. Now the full 
collection {-fQj, Bij} is a castle for the action of U. We refer to this as the 
image castle. It will be convenient to drop the double-indexing and write these 
towers simply as {Kj,Bj}. Notice that if both the orbit change from T to U 
and the sets Ai are measurable with respect to some invariant sub-u-algebra 
A then the sets Bj will automatically be ^l-measurable. 




then the tower estimate will automatically follow. 



□ 



£ #{7 G Hi : KK- 1 -) £ HijuiAi) < l/N. 



T(x) = {U 1 ,(x) : V € tfi(x)}. 
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We now verify Theorem 2.11 and the corresponding result for castles. The 
following result is not essential for our work but we include it as it is perhaps 
the most primitive of this type of result. 

Lemma 3.10. Suppose T and U are orbit- equivalent free actions of 
amenable groups T and F' with a and (3 the cocycles as described above. Given 
any K' C T' and 5' > there is a K C T and 5 > so that if F C T is 
[K, S]-invariant, then 

/J>({x : a(x,F) is [K' , S']-invariant}) > 1 — 5'. 



Proof. For x G X set n(x) = ${x,K'K'~ x ) C V. Choose the set K x so 

that 

H({x : k(x) C Ki}) > 1 - S'/3 

and set B = {x : k(x) C K{\. Choose K containing K\ and 5 < 5' /3 so that 
if F is [K, 5] -invariant then for all but a set of x of measure at most 5', the 
density in the set of points {T 7 (x) : 7 G F} of occurrences of the set B is 
at least 1 — <5'/3 (using the mean ergodic theorem.) Notice that F must be 
(Ki, (5/3)-invariant. For such an x set F' = a(x, F) and we get a lower bound 
for #{7'; K'K'~ X ^' C F'} by noting it must contain all a(x,j) where 7 G F, 
T 7 (x) G .B and K17 C F. Both of these subsets of F omit a fraction of at most 
573 from F. □ 

Theorem 3.11. Suppose T and U are orbit- equivalent free actions of 
amenable groups V and V with the cocycles a and (3 as described above. Given 
any 1 > a > and N, there is an M so that for all castles {Ai,Hi} for the 
action of T that are M-invariant and cover a of X, the image castle {Kj,Bj} 
relative to the action U must be N -invariant. 

Proof. We assume M is divisible by N and write it as M = M'N. As- 
suming {Ai, Hi} to be M-invariant there must be a subset of indices I so that 

^2 fi(Ai)#Hi > 1 — (the castle image) 

and for i £ I the set Hi is ({71, . . . , 7m'}> l/M')-invariant. For K' = {7J, . . . , 7^} 
choose K so that setting 

B = {x: fiix^K'K^ 1 )} C K 

we have n(B) > 1 — a(4iV 3 ) _2 . Let A\ C Ai consist of those x for which 

#{7 G # : T 7 (x) G -B} > (1 — ~^)#H l . 
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As J2i ^{Aij^Hi > a we must have 

i 

Choose M' > 2N 3 and so large that K C {71, . . . , 7m'}- Tnus for i & I 
Hi is [if, (2A^ 2 ) _1 ]-invariant and so for all i E I 

#{ 7 : K 7 C iJj} > (1 - g-^fli. 
For x G A[ consider k(x) = a(x , Hi) . Notice that 

#{7 G «(x) : /f'if' -1 g 

= #{7 = /?(*, 7') = 7 e Hi, 0(x, K'K'~ X i) 
= P(U Y (x),K'K'- 1 )(3(x, 1 ')£H t } 



#Hi 



< #{7' : 0(Uy(x),K'K' 1 )£K} + #{7 £H t :K 7 £ Hi} 



< #{ 1 £H i :T 1 {x)$B} + 

< 



2iV 3 

#Hi 



N 3 " 

This says for such an x the set is [K' , 1/N] -invariant completing the proof. 

□ 

The proof of Theorem 2.11 is completely parallel to this and we now 
present it. 

Proof of Theorem 2.11. Set K' = {7i,---,7at} and as in the previous 
proofs choose K so large that 

B = {x: f3{x,K') C K} 

satisfies 

fi(B) > 1 



(AN) 2 ' 

Choose M large enough that M > AN and so that K C {7^ . . . ,7m}- Sup- 
pose 5 : X — ► {the k point subsets of T} is both uniform and M-quasi-spread. 
Write 

T s{x) (x) = {V 1 (x),...,V k (x)} 

where the Vi are in the full-group of T (and hence are one-to-one and measure- 
preserving) and have distinct images. 
As n(B c ) < (4iV)" 2 , 

»{{x : #{* < : Vi(x) i B} > ^k}) < ^. 
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Set A = {x : #{i < k : Vi{x) £ B} < k/(4N)}. Consider those x G A for 
which there is a subset S'(x) C S(x) occupying a fraction 1 — 1/M of S(x) 
that is M-spread. This set has measure at least 1 - 1/M - 1/(4JV) > 1 - l/N. 
For such an x set 

s"( x ) = s'(x) n{ 7 eS: r 7 (x) g 5} 

and we note 

#S"(s) > #S'(x) - A > fc (i _ _L). 

In V(rr) = a{x, S(x)) set V (x) = a(x, S"(x)). For 7' G and 7 { G 

with 7^ / id, we need only show that 7(7' ^ V'(a;) to be finished. This is the 
same as saying 

(3{xM)iS'\x). 

Now 

/3(x,7W , ) = /3(^7'(^),7^)/3(x,7 , )• 

As i G Uy{x) G 5 and /3(x, 7') G S'(x). As 7' G K', /3([/y(x),70 G 

We must conclude, as S'(x) is M-spread, that /3(Uy( K x),j' 1 )P(x,j') cannot be 
in S'(x) which contains S"(x). □ 



4. Conditional entropy theory and proof of Theorem 2.6 

The Shannon-McMillan theorem of [7] states that if T is an ergodic action 
of r on (X, J 7 , 11) and P is a finite partition then for any 5 > if F C T is 

sufficiently invariant then among the atoms of V JL-i(P) is a collection G 

7G-F 

with: 

1) MUr,GG??) > 1 " S, 

2) < 2 W T ' P )+ 5 )# F , and 

3) for any r) G G, /*(»/) < 2 -W T < p )- 5 )# i? . 

We need a version of this fact relative to, (or conditioned on) a T-invariant 
sub-cr-algebra. To obtain this requires that we develop some basic condi- 
tional entropy theory for actions of countable amenable groups. T. Ward and 
Q. Zhang have given a development of conditional entropy for amenable group 
actions but their results (Theorem 3.2 and Corollary 3.3 of [9]) are not quite 
enough for our needs. Although it is possible to build from their development 
and get what we need for the sake of consistency and completeness we give the 
whole story here. 
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First two primitive observations about entropy that hold equally well for 
us here. If T is an action of V and F is some finite subset of T with 

Q C V T 7 -i(P) 

a finite partition then 

h(T, Q) < h{T,P). 

Next, for any vector p = {pi, . . . ,p t } of values pi > with J2Pi < 1 (y es J 
<) we can set pt+i = 1 — J2Pi an d let 

t+i 

#(p) = -Eft 1 °g(p«)- 
i=i 

Suppose P and P' are partitions of X labeled by the same symbols, i.e. 
P, P' : X — ► E where E is a finite set. We write 

PAP' = {x : P{x) / P'(x)}. 

The following inequality always holds: 

\h(T, P) - h(T, P')\ < H^PAP')) + /x(PAP') log(#E). 

In particular among E-valued partitions h(T, P) is uniformly continuous in the 
A •) metric. 

Definition 4.1. We abstract the notion in the conclusion of the Shannon- 
McMillan theorem of [7] as follows. For K C T a finite set and p a probability 
measure on T> K , we say p is (/i, 5)-flat if there is a subset So C T, K with: 

1) p(S ) >1-S, 

2) #S < 2( h+ ^# K , and 

3) for any rj G So, Aifa) < 2-( ft - <5 ) # ^. 

For a probability measure on T, K like p in this definition let 

The following are basic calculations. 

Lemma 4.2. If p satisfies 1) and 2) above then 

h K {p) < H{5) + <51og(#E) + (1 - S)(h + 5)<h + H(S) + <51og(#E). 

If p satisfies 1) and 3) afroue i/ien 

h K {p) > (1 - <5)(/i - <5) > /i - <5(log(#E) + 1). 
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Now for the technical fact that if one writes a "fiat" distribution on names 
as a convex combination of distributions without dropping the average entropy 
by much, then most of the terms in the convex combination must also be pretty 
flat. 

Theorem 4.3. Given h>0 and 8 > there is a Si so that if#K > 1/Si, 
p is (h, Si)- flat and we write p as a convex combination of measures 

p = J p t dt 

with 

/ h K (p t )dt > h K (p) - Si, 

then 

fj,({t : p t is (h, 5) -flat }) > 1 - 5. 



Proof. In this calculation we will use Fi(z) to represent a sequence of 
functions, all of which tend to as z — ► 0. We will find F4 so that for all but 
i*4(<$i) of the values t, pt is (h, i*4(<$i))-flat. 

Assume K and p are as described for h and 8\. Let So C T, K be the set 
of names making p an (h, <5i)-flat distribution. For all but \f8~1 of the values t, 

Pt(So) > 1 - \/S~i, 

for example, pt satisfies 1) and 2) of (h, v / Si)-flatness and hence 

h K (pt) <h + H{y/S[) + v^i"(log(#S) + 1). 

Thus there is a function F\ and for all but Fi{5\) of the values t, 

h K {pt) <h + Fi(Si). 

For all t we have hxipt) < log(#S) so as / hxipt) dt > h — 5i there is an 
F2 and for all but i^O^i) of the values t, 

h-F 2 (5i) < h K { Pt ) <h + F 2 (5 1 ) 

and 

Pt(So) >l~F 2 (Si). 

Suppose for such a pt there is a c > and a set Si C H K with pt(Si) = c 
and for any name n £ C 

p t (v) > 2-^* K . 
We want to see that c must be small. 
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Split E into three sets: 

1) Si of measure c, 

2) So\S\ of measure > 1 — 5± — c, and 

3) the rest, of measure < <5i. 

We calculate, conditioning on this three set-partition, that 

h K (pt) < ^jj^A + c(h - c ) + (l-S 1 -c)(h + S ± ) + ft log(#S) 

and 

h - F 2 (S 1 ) <h-c 2 + 5 1 log(#S) + -g- + Si 

and as #K > 1/S\, 

c 2 <F 2 (5 1 )+5 1 log(#E) + <5 1 (log3 + l). 
We conclude that there is an F3 with 

c < F 3 (<5i). 

Let St = So\Si and we have: 

1) Pt(St) >1-F 2 (S 1 )-F 3 (S 1 ), 

2) #St < #S < 2^ h+6 ^# K , and 

3) for r] G St, pt{v) < 2- ( - h - F - i( - Sl »# K . 

That is to say for F 4 (z) = F 2 {z) + F 3 (z) + z, p t is (h, F 4 ((5i))-flat. For 
5 > choose <5i so that F^di) < S to give the result. □ 

Of course the disintegration of the distribution envisioned in this result 
will arise from conditioning on a sub-cr-algebra. 

We now start to work with conditional entropy. Let A be a sub a-algebra 
and P a finite partition. For our purposes it is best to think of this partition 
as a map P : X — > E where E is a finite labeling set. The partition in the 
usual sense is then the collection of sets {P _1 (s)} se s- As described earlier by 
E(P\A) we mean the probability vector-valued function (E(l P -i/ s \\A)) s e^- 
By L(P\A) we mean the conditional entropy as an ^l-measurable function 

L(P\A){x) = H(E(P\A){x)) 

and now the conditional entropy of P given the u-algebra A is defined as 

h{P\A) = I L{P\A)d^. 
Jx 
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Lemma 4.4. ForT an ergodic action ofT, P and Q two finite partitions, 
and any 5 > if F is sufficiently invariant, then for all but S in measure of X 

E( V T 7 -i(P)| V T i(Q)) 

7€F 7G-F 

is (5, h)-flat where h = h{T, P V Q) - h(T, Q). 

Proof. From Corollary 3.0.26 of [4] which is a simple extension of the 
Shannon-McMillan theorem of [7] (§11.4 #5), if F is sufficiently invariant, then 
for all but (5/2) 2 in measure of the atoms rj of vyT 7 -i (P V Q) we have 

1) /i ( r? ) = 2 -(^m-PVQ)±(«5/2) 2 )#F 

and for all but 5/2 in measure of the atoms r/ G V^T 7 -i(Q), 

2) M (r/) = 2 -( /i ™±5/2)#F 

Consider those atoms 7/ satisfying 2) and all but a fraction <5/2 in measure 
covered by atoms 77 satisfying 1). For x in such an rj' and also in such an r\ we 
compute 



hence 



^l 7 V F r 7 - l( Q))(x) = ^ = 2-(^; 



E( V T i(F)| V T i(Q)) 



is (5, /i)-flat. □ 
Our next problem is to show that if F is sufficiently invariant then 

cannot be much smaller than 

h( V T V T 7 -i(Q))/#F. 

■yeF ' 7GF ' 

Theorem 4.5. For T an ergodic action of T, P and Q two finite parti- 
tions and 5 > 0, if F is sufficiently invariant then 

^LhiVT^ (P)l 7 V r T 7 -i (Q)) > h(T, P V Q) - h(T, Q) - 6. 



Proof. Suppose F C F' and notice that the calculation 

1 -M yj,-AP)\ yr r i(Q)) 



#F'" V 7GF 7 7 GF' 7 

can only decrease as F' increases as a set. Suppose the result is false, that is 
to say fails to hold for some Sq > 0. This would mean we will be able to find 
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arbitrarily invariant F's for which it fails. In particular for any <5i > we will 
be able to find Fi, . . . F/v which oVquasi tile all sufficiently invariant F's and 
a single F' finite with 

jTEtM V T 7 -!(P)| y ■ T^(Q)) < h(T,PV Q) - h(T,Q) - 5 /2. 

Choose F to also be [F' , Si] -invariant. Now we take a oVquasi tiling of F by 
Fi, . . . , F/v with centers Cij, i = 1, . . . , N and j = 1, . . . , k(i). Delete from the 
list of centers all Cjj with FjCjj not contained in F. The remainder of the tiles 
will still 25i-quasi tile F. 

We calculate with this an upper bound for 

1 ;/i(_VT 7 -i(P)| V T 7 -i(Q)) 



#F 7 eF 7 -yeF 

by successively adding on a conditional entropy of a span over a new tile of 
indices Fidj until they are exhausted and then adding a maximal bound for 
the added entropy of the remaining indices. This gives the inequalities 

^ ^fp-^KT, PVQ)~ HT, Q) - 5 /2) + 26, log(#S P ) 
< (1 + Si)(h(T, PVQ)- h(T, Q)) + 26, log(E P ) - <5 /2. 

If F is sufficiently invariant we conclude from Lemma 4.4 that 

(1) h(T,P\/Q)-h(T,P)-5 1 < (l + 5i)(h(T,P\/Q)-h(T,Q)) 

+ 25, log(Ep) - 5 /2 
for all values <5i > 0. But this conflicts with Sq > 0. □ 

Corollary 4.6. For any T-invariant sub-a- algebra A and 5 > 0, z/F 
is sufficiently invariant, for all but 5 in measure of the x G X, the conditional 
measure 

E{ VT rl (P)|i)(z) 



is (h{T,P\A),6)-flat. 
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Proof. Fix a partition Q. As the invariance of F improves the measures 
E( V T i(P)| y T 7 -i(Q))(s) 

become ever natter on ever more of X, and the amount the entropy can de- 
crease when further conditioning from V JL-i (Q) to V ZL-i (Q) becomes ever 

7 eF 7 7 er 7 

smaller. Thus Theorem 4.3 gives the result for A of the form V_T 7 -i (Q). Any 

invariant sub-a-algebra is an increasing span of such algebras Ai- The condi- 
tional entropies h( V T--i(P)\Ai)/#F decrease in i converging to 

h( v v(p)w/p 

This latter, as F becomes larger and more invariant, decreases to a value we 
call h(T,P\A). 

Thus for any 5 > 0, once i is large enough and F sufficiently invariant, 

^hivT^PU) > ^( T V F r 7 _ l( p)|A) - 5. 

Once F is sufficiently invariant for all but 5 in measure of the x £ X, we 
already know E{ V T 7 -i(P)|A) will be {h(T, P V Q,) - h(T, Qi), <5)-flat. Once 

more Theorem 4.3 gives the result. □ 



Corollary 4.7. ^4s the invariance of a set F increases to infinity, 
1 

#F~\eF' 



— L( y^r 7 -i(P)|^) /i(t,p|^). 



Proof. Notice that 
1 



L( V T„-i(P)|^)(x)<log(#S P ). 



#P 7GF 

The increasing flatness of the measures E( V T J -i(P)\A) now gives the result. 

□ 

The corollary above can be obtained quite easily as well from Theorem 
3.2 of [9] where it is shown that I( V T~-i(P)\A)/#F converges in L\ as the 

7GF 

invariance of F increases. 

Our next step is to show that conditional entropy can be found by ex- 
amining it on sufficiently invariant castles. This is the critical step in proving 
Theorem 2.6. Our approach is parallel to that of our proof of the ergodic 
theorem on castles, we will show that if it holds on some sequence of castles 
of growing invariance then it must hold on all such. To get the existence of a 
sequence we use the same trick as before, we add on another coordinate and 
note that then it is implied by the previous corollary. 
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To begin, for any finite subset H C T and A C X with the pair forming a 
tower we can calculate a local conditional entropy on the tower H x A as 

h (HA) (T,P\A) = J a L(V h T 7 -i(P)\A). 

Notice we have not normalized this by either /j,(A) or by #H. The reason is 
easy to understand from the inequalities 

h (HA) (T,P\A) < J #H\ogndii < logn#Hfi(A) 

where P is an n-set partition. That is to say this tower entropy is automatically 
bounded by the measure of the tower image times logn. For a castle T = 
{Hi,Ai} we can define a conditional entropy on the castle now as 

h r (T,P\A) = J2hh i ,A i )(T,P\A) 

i 

and again we see that this tower entropy will be bounded by the measure of 
the castle image times log n. 

Lemma 4.8. Suppose T is a free and ergodic action of the countable and 
amenable group F, A is an invariant sub-a -algebra and P is an n-set partition. 
Suppose T = {Hi,Ai} is a castle covering all but 5 > of X with Ai G A. 
Given any e > there is an N, so that for all castles T = {Kj, Bj} that are 
N -invariant with Bj £ A we will have 

h r >(T,P\A) < h T {T,P\A) + 5logn + e. 



Proof. We will abuse language a bit here and use /x(T) and fi(T') to 
represent measures of the two castle images. We assume now that the partition 
P and the castle T are given. For a castle T = {Kj,Bj} and x £ Bj define 

(2) d{x) = {7 G Kj : there exists an i with 

7 = 727D T 7l (x) G Ai, 72 G Hi and Hai C Kj} 

that is to say the collection of elements in Kj which lie in a slice through the 
T castle which lies completely inside this slice through the T'-castle. 

We note that if T is sufficiently invariant, by the ergodic theorem on 
towers and this invariance we will obtain 

e 



E 

3 



2 logn 



Define a function h on X by 

V •Tt-i(P)\A){x>) ifx = T 7 (x'), 7 eHi,x'eAi 

h(x) = ^ 

1 otherwise. 
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Notice that h(x) < logn and 

J h(x)dn = h r (T,P\A). 



Using the ergodic theorem on towers, once T is sufficiently invariant we 
will obtain 



J 

and in particular 



E/ E KT^xV-XKjhTCPlA) 



E / E ^CW) < M^. iV) + e/2- 

«/ Bn . 



For x G i?j for a first estimate 



L( V • T^{P)\A){x)<L{ V r 7 _ 1 (P)|^l)(x) + (#K J -#cZ(x))logn. 

7Gifj 7Gc((a;) 



Let di(x) = {7 G cZ(x) : T 7 (x) G A«} and now 

d(x)=\J (J U T 7 2 7lW 
* 71 eci, (a;) 72 

a disjoint union. 

Choose some ordering for each of the di{x) = {7^} and set 

Cj,fe = {7 G : 7 = 7i7i'fe' where 71 S i,', i' < i and fc' < k}. 

What this has done is simply to give an ordering to the slices through T 
that lie within the slice of T passing through x with C\ t k consisting of those 
elements in the slice that occur earlier in the order than the one through 7^. 

As both T and T are ^.-measurable we compute that 

/( V ■ T y -i(P)\A)(x) = E L (V/T V T 7 -i(P)V^)(T 7 . fc (x)) 

^ E L ( V T y -i(P)\A)(T^ k {x)) 
= E HT 7 (x)). 

■yed(x) 

We now complete our work by combining our estimates to give 

M (T, P\A) = E/ i( v V(P)Wd/i 

< E / L ( Y r 7 -i(P)|>i)(x)dM(^) 

+ E / i#Kj - #d{x))\agndii 
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< 



3 Bj -red(x) 




< 



< 



12 12 h(T J (x))dfi + 5logn + e/2 

3 JB iieK j 

h T (T,P\A) + 8\ogn + e. 



□ 



Corollary 4.9. For T an ergodic action of the countable and amenable 
group r, A a T-invariant sub-a- algebra and P an n-set partition, then for 
any e > there is an N and a 5 so that if T = {Hi,Ai} is a castle that is 
N -invariant and covers all but 5 of X, then 



Proof. We split the proof into two pieces. First, suppose that T possesses 
a sequence of castles % which are „4-measurable, whose invariance tends to 
infinity, and for which ^{Tj) — ► 1 and 



and using e/3 in the previous lemma we conclude that once T is sufficiently 
invariant and ^.-measurable we will have 



Now fixing such a T and again using the previous lemma with e/2, choose 
Tj with h Tj (T, P\A) > h(T, P\A) - e/2 and giving now 

h(T, P\A) < h Tj (T, P\A)+e/2< h T (T, P\A) +e/2 + 5 log n + e/2 

giving the result. 

Where will we find the castles %? We use the same trick as we did for 
the ergodic theorem on towers. Construct a direct product T x T2 that is still 
ergodic. We leave P fixed but replace A with A\ = A V 82- For any tower 
{H, A} that is immeasurable, 



h(T, P\A) + 5 log n + e < h T (T, P\A) < h{T, P\A) + e. 



h Tl (T,P\A)^h(T,P\A). 
Choose a % covering at least e/(31ogn) of X and so that 

h Ti (T,P\A) < h(T,P\A)+e/3 



h T (T, P\A) < h Ti (T, P\A) + 2e/3 < h(T, P\A) + e. 
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will converge to h(T,P\A). Thus if we choose a sequence of castles % which 
are £>2-measurable, whose invariance grows and which cover more and more 
of X x X2 we will have the sequence of castles we want. We now obtain the 
conclusion for all castles in the product action that are .Ai-measurable and in 
particular for those just in the first coordinate and „4-measurable. □ 

Proof of Theorem 2.6. From Theorem 2.11 we know that for any sequence 
of castles % for the action of T whose invariance grows to infinity, the image 
castles T? for the action of U will have the same property. If the % are A- 
measurable, as the orbit change is also ^.-measurable, the castles T{ will again 
be ^.-measurable. Hence we must have both 

lim h Ti (T,P\A) = h(T,P\A) 

and 

lim h T ,(U,P\A) = h(U,P\A). 

As the orbit change is ^.-measurable, for almost every x G X and finite 
set H C T, letting H' = a(x,H), 

L(^ H T, rl (P)\A)(x) = L(Vy eJr t/ 7 ,-i(P)|.A)(z); 
hence for all % we have 

h Ti {T,P\A) = h T ,(U,P\A) 
which completes the result. □ 

As promised in the introduction, we will finish by showing that in certain 
special cases it is easy to see why the relative Pinsker algebra of a direct 
product over its first coordinate is simply the span of the first coordinate and 
the Pinsker algebra of the second. In particular our argument will hold when 
the first coordinate action is Bernoulli. 

Theorem 4.10. Suppose T\ andT2 are two ergodic and measure-preserv- 
ing actions of T on the spaces (Xi,B±, fi±) and (X2, £>2;M2) respectively. Let 
II(Tj) represent the Pinsker algebra of T{ and £ represent the relative Pinsker 
algebra of the direct product T\ x T2 with respect to the first coordinate algebra 
B\. Suppose that C(Ti), the group of all measure-preserving bijections of X\ 
commuting with all elements ofT±, acts ergodically on X\. We conclude that 

£ = B 1 x n(T 2 ). 

Proof. Denote by £ X1 the trace of £ on the fiber x± x X 2 . Up to /x 2 - 
null sets we can identify £ Xl with the conditional expectation operator from 
L 2 (A^2, £>2, M2) to L-iiXi^ £ X1 , M2) which we write as E X1 . Using the strong 
operator topology on these conditional expectations, we get a map from X\ 
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into a subset of the closed subset of conditional expectation operators in this 
Polish space. Notice that for any function / G L 2 (X 2 , #2, fi 2 ) we can lift / to 
X\ x X 2 as f(xi,X2) = f{x2) and we have 

E Xl (f)(x 2 ) = E(f\£)(x 1 ,x 2 ). 

This shows that for any / the evaluation of E xi (f)(x 2 ) is jointly measurable 
in both variables and hence the map E X1 is a measurable map to this Polish 
(in particular separable metric) space. If S is an element of C{T\) the fact 
that S x id(£) = £ tells us that E$ Xl = E Xl for /^i-a.e. x\. The ergodicity of 
C(Ti) now implies that the operators E Xl must be /txi-almost surely constant, 
and hence the algebras £ X1 must be almost surely a constant £q. Since £>i is 
contained in £ we conclude that £ = B\ x £q and it is an observation that 
£0 = II(T2) as any set in £>2 with zero conditional entropy over B\ must have 
zero entropy for the action T 2 . □ 

For our purposes here all we need is a single example of such a map T\ 
for which we can guarantee the T\ x T 2 will remain ergodic for all ergodic T 2 . 
Here is perhaps the simplest such example. Let X = {0, l} r be the full 2-shift 
over T with the left action, i.e. 

T 7 (x 1 )( 7 ')=x 1 ( 7 - 1 7') 

and let fi\ be independent and identically distributed measure (l/2,l/2) r . 
Notice that C (Ti ) will contain the right action of V 

S 1 {x 1 ){ 1 ')=x 1 { 1 ' 11 ). 

The action S is just as ergodic as that of T. 

Corollary 4.11. IfT\ acting on {X\,B\,n{) is Bernoulli andT 2 acting 
on (X 2 , B 2 , fi 2 ) is cpe then the product action T\ x T 2 is relatively cpe over X±. 



5. Final comments 

Recent work in this area has not focused on proving that completely pos- 
itive entropy implies multiple mixing, but rather on the characterizing the 
unitary theory of such systems. In particular the goal is to show that the 
spectral type of any system of a completely positive entropy action of T is that 
of the shift on £ 2 of the group T. This has been shown for Z n by B. Kaminsky 
and for the discrete rationals by J. -P. Thouvenot. Is it possible to approach 
this question using the orbit equivalence notions presented here? That is to 
say can one pull the relativized spectral structure through an orbit equivalence 
to show that the result is simply the transference of the classical fact for Z? 
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Having restricted our attention here to countable amenable groups one 
must ask whether the methods here will apply to continuous such groups. The 
answer is certainly going to be yes. For groups that possess a cocompact 
countable subgroup the result on the subgroup together with some coding 
approximations will give the result for the larger group. For groups which do 
not possess such a subgroup the path is a bit trickier. Remember that central 
to our proof was the taking of a direct product with a standard second action. 
We once more do this but in this standard action we also select a section. The 
induced equivalence relation on this section will be hyperfinite of type Hi, that 
is to say orbit-equivalent to a measure-preserving action of Z. It is through 
this orbit equivalence that we will transfer the relative cpe property and mixing 
properties to obtain our result. This work will be presented separately. 

We end with a tantalizing open question. An area of the ergodic theory 
of discrete amenable groups that remains very cloudy is the nature of F0lner 
sequences along which the pointwise ergodic theory holds for all / G L 1 . We 
pose the following question: Suppose T is some fixed action of the countable 
and discrete amenable group T and U is an action of Z with the same orbits. 
For a.e. x G X Lemma 3.10 tells us the sequence of sets F n (x) = {7 G T : 
T 7 (x) = U{(x), —n < i < n} is a F0lner sequence in V. Is it the case that for 
a.e. x 6 X this sequence is universally good for the pointwise ergodic theorem? 
What is easily seen from our transference is that for any action S of T on a 
space Y and any / G L l (Y) that for almost every x £ X, the sequence F n {x) 
is good for the pointwise ergodic theorem for the S action on /. 
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